Confidence measurement techniques in automatic speech recognition and dialog management
نویسنده
چکیده
Reliable confidence measures are essential to the basis of decisionmaking for enriching human-machine speech interaction with necessary intelligence in ergonomic dialog management. In addition to a survey of the state of the art in confidence measurement, this work also provides classification of methods derivated from several points of view and describes possible fields of application. The thesis includes comparative evaluation results of different computation algorithms which apply posterior probability as the hypothesis confidence measure in HMM-based speech recognition. The key contribution of the dissertation is the description of several utilization techniques that rely on confidence measurement and are intended to enhance the performance of speech recognition systems. A new confidence-guided approach is presented to control the pruning of the Viterbi search process dynamically by taking variable search quality into consideration to fit time-variant requirements. The thesis explores dialog management strategies and several aspects of improving user acceptance in speech-based applications by the use of confidence measurement.
منابع مشابه
Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کاملPartially observable Markov decision processes for spoken dialog systems
In a spoken dialog system, determining which action a machine should take in a given situation is a difficult problem because automatic speech recognition is unreliable and hence the state of the conversation can never be known with certainty. Much of the research in spoken dialog systems centres on mitigating this uncertainty and recent work has focussed on three largely disparate techniques: ...
متن کاملLikelihood ratio decoding and confidence measures for continuous speech recognition
Automatic speech recognition (ASR) systems are being integrated into a wider variety of tasks involving human{machine interaction. In evaluating these systems, however, it has become clear that more accurate means must be developed for detecting when portions of the decoded recognition hypotheses are either incorrect or represent out{of{vocabulary utterances. This paper describes the use of con...
متن کاملUsing Utterance and Semantic Level Confidence for Interactive Spoken Dialog Clarification
Spoken dialog tasks incur many errors including speech recognition errors, understanding errors, and even dialog management errors. These errors create a big gap between the user’s intention and the system’s understanding, which eventually results in a misinterpretation. To fill in the gap, people in human-to-human dialogs try to clarify the major causes of the misunderstanding to selectively c...
متن کاملProsody Modeling for Automatic Speech Recognition and Understanding
This paper summarizes statistical modeling approaches for the use of prosody (the rhythm and melody of speech) in automatic recognition and understanding of speech. We outline effective prosodic feature extraction, model architectures, and techniques to combine prosodic with lexical (word-based) information. We then survey a number of applications of the framework, and give results for automati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008